Dataset info
| Number of variables | 29 |
|---|---|
| Number of observations | 4441 |
| Missing cells | 0 (0.0%) |
| Duplicate rows | 0 (0.0%) |
| Total size in memory | 4.6 MiB |
| Average record size in memory | 1.1 KiB |
Variables types
| NUM | 17 |
|---|---|
| CAT | 11 |
| URL | 1 |
Reproduction info
| Date of analysis | 2020-01-17 12:40:30.800175 |
|---|---|
| Version | pandas-profiling v2.4.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download Configuration | config.yaml |
Warnings
actor_1_facebook_likes is highly skewed (γ1 = 20.39377482) | Skewed |
actor_1_name has a high cardinality: 1807 distinct values | Warning |
actor_2_name has a high cardinality: 2673 distinct values | Warning |
actor_3_facebook_likes has 52 (1.2%) zeros | Zeros |
actor_3_name has a high cardinality: 3156 distinct values | Warning |
budget is highly skewed (γ1 = 25.93138816) | Skewed |
country has a high cardinality: 54 distinct values | Warning |
director_facebook_likes has 753 (17.0%) zeros | Zeros |
director_name has a high cardinality: 2100 distinct values | Warning |
facenumber_in_poster has 1890 (42.6%) zeros | Zeros |
genres has a high cardinality: 851 distinct values | Warning |
movie_facebook_likes has 2024 (45.6%) zeros | Zeros |
movie_title has a high cardinality: 4441 distinct values | Warning |
plot_keywords has a high cardinality: 4437 distinct values | Warning |
cast_total_facebook_likes is highly correlated with actor_1_facebook_likes | High Correlation |
actor_1_facebook_likes is highly correlated with cast_total_facebook_likes | High Correlation |
| Distinct count | 819 |
|---|---|
| Unique (%) | 18.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6910.11011 |
|---|---|
| Minimum | 0 |
| Maximum | 640000 |
| Zeros | 9 |
| Zeros (%) | 0.2% |
| Memory size | 34.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 144 |
| Q1 | 660 |
| median | 1000 |
| Q3 | 12000 |
| 95-th percentile | 24000 |
| Maximum | 640000 |
| Range | 640000 |
| Interquartile range (IQR) | 11340 |
Descriptive statistics
| Standard deviation | 14779.6342 |
|---|---|
| Coefficient of variation (CV) | 2.138842069 |
| Kurtosis | 786.7349524 |
| Mean | 6910.11011 |
| Median Absolute Deviation (MAD) | 7864.497057 |
| Skewness | 20.39377482 |
| Sum | 30687799 |
| Variance | 218437587.2 |
| Value | Count | Frequency (%) | |
| 1000 | 404 | 9.1% | |
| 11000 | 204 | 4.6% | |
| 2000 | 179 | 4.0% | |
| 3000 | 141 | 3.2% | |
| 12000 | 131 | 2.9% | |
| 13000 | 121 | 2.7% | |
| 14000 | 119 | 2.7% | |
| 18000 | 105 | 2.4% | |
| 10000 | 104 | 2.3% | |
| 22000 | 77 | 1.7% | |
| Other values (809) | 2856 | 64.3% |
| Value | Count | Frequency (%) | |
| 0 | 9 | 0.2% | |
| 2 | 5 | 0.1% | |
| 3 | 2 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| 5 | 3 | 0.1% |
| Value | Count | Frequency (%) | |
| 640000 | 1 | < 0.1% | |
| 260000 | 1 | < 0.1% | |
| 164000 | 2 | < 0.1% | |
| 137000 | 2 | < 0.1% | |
| 87000 | 8 | 0.2% |
| Distinct count | 1807 |
|---|---|
| Unique (%) | 40.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.8 KiB |
| Robert De Niro | 47 |
|---|---|
| Johnny Depp | 36 |
| Nicolas Cage | 32 |
| J.K. Simmons | 29 |
| Denzel Washington | 29 |
| Other values (1802) |
| Value | Count | Frequency (%) | |
| Robert De Niro | 47 | 1.1% | |
| Johnny Depp | 36 | 0.8% | |
| Nicolas Cage | 32 | 0.7% | |
| J.K. Simmons | 29 | 0.7% | |
| Denzel Washington | 29 | 0.7% | |
| Matt Damon | 28 | 0.6% | |
| Bruce Willis | 28 | 0.6% | |
| Steve Buscemi | 27 | 0.6% | |
| Harrison Ford | 27 | 0.6% | |
| Liam Neeson | 26 | 0.6% | |
| Other values (1797) | 4132 | 93.0% |
Composition
| Contains chars | True |
|---|---|
| Contains digits | True |
| Contains whitespace | True |
| Contains non-words | True |
Length
| Max length | 27 |
|---|---|
| Mean length | 13.19229903 |
| Min length | 4 |
actor_2_facebook_likes
Real number (ℝ≥0)
| Distinct count | 897 |
|---|---|
| Unique (%) | 20.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1748.402387 |
|---|---|
| Minimum | 0 |
| Maximum | 137000 |
| Zeros | 24 |
| Zeros (%) | 0.5% |
| Memory size | 34.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 44 |
| Q1 | 321 |
| median | 626 |
| Q3 | 936 |
| 95-th percentile | 11000 |
| Maximum | 137000 |
| Range | 137000 |
| Interquartile range (IQR) | 615 |
Descriptive statistics
| Standard deviation | 4178.221576 |
|---|---|
| Coefficient of variation (CV) | 2.389736829 |
| Kurtosis | 253.2704876 |
| Mean | 1748.402387 |
| Median Absolute Deviation (MAD) | 2089.040018 |
| Skewness | 9.904953976 |
| Sum | 7764655 |
| Variance | 17457535.53 |
| Value | Count | Frequency (%) | |
| 1000 | 288 | 6.5% | |
| 11000 | 105 | 2.4% | |
| 2000 | 93 | 2.1% | |
| 3000 | 72 | 1.6% | |
| 10000 | 45 | 1.0% | |
| 13000 | 39 | 0.9% | |
| 14000 | 38 | 0.9% | |
| 826 | 35 | 0.8% | |
| 4000 | 32 | 0.7% | |
| 12000 | 29 | 0.7% | |
| Other values (887) | 3665 | 82.5% |
| Value | Count | Frequency (%) | |
| 0 | 24 | 0.5% | |
| 2 | 8 | 0.2% | |
| 3 | 7 | 0.2% | |
| 4 | 6 | 0.1% | |
| 5 | 8 | 0.2% |
| Value | Count | Frequency (%) | |
| 137000 | 1 | < 0.1% | |
| 29000 | 1 | < 0.1% | |
| 27000 | 2 | < 0.1% | |
| 25000 | 2 | < 0.1% | |
| 23000 | 6 | 0.1% |
| Distinct count | 2673 |
|---|---|
| Unique (%) | 60.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.8 KiB |
| Morgan Freeman | 18 |
|---|---|
| Charlize Theron | 14 |
| Brad Pitt | 13 |
| Meryl Streep | 11 |
| Adam Sandler | 10 |
| Other values (2668) |
| Value | Count | Frequency (%) | |
| Morgan Freeman | 18 | 0.4% | |
| Charlize Theron | 14 | 0.3% | |
| Brad Pitt | 13 | 0.3% | |
| Meryl Streep | 11 | 0.2% | |
| Adam Sandler | 10 | 0.2% | |
| James Franco | 10 | 0.2% | |
| Bruce Willis | 9 | 0.2% | |
| Scott Glenn | 9 | 0.2% | |
| Will Ferrell | 9 | 0.2% | |
| Kirsten Dunst | 8 | 0.2% | |
| Other values (2663) | 4330 | 97.5% |
Composition
| Contains chars | True |
|---|---|
| Contains digits | True |
| Contains whitespace | True |
| Contains non-words | True |
Length
| Max length | 28 |
|---|---|
| Mean length | 13.06282369 |
| Min length | 3 |
| Distinct count | 901 |
|---|---|
| Unique (%) | 20.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 672.8203107 |
|---|---|
| Minimum | 0 |
| Maximum | 23000 |
| Zeros | 52 |
| Zeros (%) | 1.2% |
| Memory size | 34.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 16 |
| Q1 | 158 |
| median | 391 |
| Q3 | 650 |
| 95-th percentile | 1000 |
| Maximum | 23000 |
| Range | 23000 |
| Interquartile range (IQR) | 492 |
Descriptive statistics
| Standard deviation | 1699.183083 |
|---|---|
| Coefficient of variation (CV) | 2.525463421 |
| Kurtosis | 57.79872839 |
| Mean | 672.8203107 |
| Median Absolute Deviation (MAD) | 584.2156617 |
| Skewness | 7.112386817 |
| Sum | 2987995 |
| Variance | 2887223.151 |
| Value | Count | Frequency (%) | |
| 1000 | 115 | 2.6% | |
| 0 | 52 | 1.2% | |
| 11000 | 27 | 0.6% | |
| 2000 | 25 | 0.6% | |
| 3000 | 24 | 0.5% | |
| 826 | 21 | 0.5% | |
| 249 | 19 | 0.4% | |
| 7 | 17 | 0.4% | |
| 51 | 16 | 0.4% | |
| 3 | 16 | 0.4% | |
| Other values (891) | 4109 | 92.5% |
| Value | Count | Frequency (%) | |
| 0 | 52 | 1.2% | |
| 2 | 13 | 0.3% | |
| 3 | 16 | 0.4% | |
| 4 | 15 | 0.3% | |
| 5 | 8 | 0.2% |
| Value | Count | Frequency (%) | |
| 23000 | 2 | < 0.1% | |
| 20000 | 1 | < 0.1% | |
| 19000 | 4 | 0.1% | |
| 17000 | 1 | < 0.1% | |
| 16000 | 3 | 0.1% |
| Distinct count | 3156 |
|---|---|
| Unique (%) | 71.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.8 KiB |
| Steve Coogan | 8 |
|---|---|
| Robert Duvall | 7 |
| Sam Shepard | 7 |
| Ben Mendelsohn | 7 |
| Stephen Root | 7 |
| Other values (3151) |
| Value | Count | Frequency (%) | |
| Steve Coogan | 8 | 0.2% | |
| Robert Duvall | 7 | 0.2% | |
| Sam Shepard | 7 | 0.2% | |
| Ben Mendelsohn | 7 | 0.2% | |
| Stephen Root | 7 | 0.2% | |
| Jon Gries | 6 | 0.1% | |
| John Gielgud | 6 | 0.1% | |
| Lois Maxwell | 6 | 0.1% | |
| Thomas Lennon | 6 | 0.1% | |
| Bruce McGill | 6 | 0.1% | |
| Other values (3146) | 4375 | 98.5% |
Composition
| Contains chars | True |
|---|---|
| Contains digits | True |
| Contains whitespace | True |
| Contains non-words | True |
Length
| Max length | 27 |
|---|---|
| Mean length | 13.0565188 |
| Min length | 3 |
aspect_ratio
Real number (ℝ≥0)
| Distinct count | 20 |
|---|---|
| Unique (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.104130384 |
|---|---|
| Minimum | 1.18 |
| Maximum | 16 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 34.8 KiB |
Quantile statistics
| Minimum | 1.18 |
|---|---|
| 5-th percentile | 1.78 |
| Q1 | 1.85 |
| median | 2.2 |
| Q3 | 2.35 |
| 95-th percentile | 2.35 |
| Maximum | 16 |
| Range | 14.82 |
| Interquartile range (IQR) | 0.5 |
Descriptive statistics
| Standard deviation | 0.5009573553 |
|---|---|
| Coefficient of variation (CV) | 0.2380828484 |
| Kurtosis | 531.2858039 |
| Mean | 2.104130384 |
| Median Absolute Deviation (MAD) | 0.2715259508 |
| Skewness | 19.13615632 |
| Sum | 9344.443036 |
| Variance | 0.2509582718 |
| Value | Count | Frequency (%) | |
| 2.35 | 2189 | 49.3% | |
| 1.85 | 1808 | 40.7% | |
| 2.104130384 | 146 | 3.3% | |
| 1.37 | 92 | 2.1% | |
| 1.78 | 63 | 1.4% | |
| 1.66 | 60 | 1.4% | |
| 1.33 | 31 | 0.7% | |
| 2.39 | 14 | 0.3% | |
| 2.2 | 13 | 0.3% | |
| 16 | 4 | 0.1% | |
| Other values (10) | 21 | 0.5% |
| Value | Count | Frequency (%) | |
| 1.18 | 1 | < 0.1% | |
| 1.2 | 1 | < 0.1% | |
| 1.33 | 31 | 0.7% | |
| 1.37 | 92 | 2.1% | |
| 1.5 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 16 | 4 | 0.1% | |
| 2.76 | 3 | 0.1% | |
| 2.55 | 2 | < 0.1% | |
| 2.4 | 3 | 0.1% | |
| 2.39 | 14 | 0.3% |
| Distinct count | 404 |
|---|---|
| Unique (%) | 9.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38284338.29 |
|---|---|
| Minimum | 218 |
| Maximum | 4200000000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 34.8 KiB |
Quantile statistics
| Minimum | 218 |
|---|---|
| 5-th percentile | 1000000 |
| Q1 | 8000000 |
| median | 24000000 |
| Q3 | 42000000 |
| 95-th percentile | 125000000 |
| Maximum | 4200000000 |
| Range | 4199999782 |
| Interquartile range (IQR) | 34000000 |
Descriptive statistics
| Standard deviation | 99322701.58 |
|---|---|
| Coefficient of variation (CV) | 2.594342909 |
| Kurtosis | 899.8248003 |
| Mean | 38284338.29 |
| Median Absolute Deviation (MAD) | 31683166.12 |
| Skewness | 25.93138816 |
| Sum | 1.700207463e+11 |
| Variance | 9.864999048e+15 |
| Value | Count | Frequency (%) | |
| 36541497.42 | 309 | 7.0% | |
| 20000000 | 166 | 3.7% | |
| 30000000 | 134 | 3.0% | |
| 25000000 | 133 | 3.0% | |
| 15000000 | 133 | 3.0% | |
| 40000000 | 128 | 2.9% | |
| 10000000 | 126 | 2.8% | |
| 35000000 | 117 | 2.6% | |
| 50000000 | 98 | 2.2% | |
| 5000000 | 97 | 2.2% | |
| Other values (394) | 3000 | 67.6% |
| Value | Count | Frequency (%) | |
| 218 | 1 | < 0.1% | |
| 1100 | 1 | < 0.1% | |
| 4500 | 1 | < 0.1% | |
| 7000 | 3 | 0.1% | |
| 9000 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 4200000000 | 1 | < 0.1% | |
| 2500000000 | 1 | < 0.1% | |
| 2400000000 | 1 | < 0.1% | |
| 2127519898 | 1 | < 0.1% | |
| 1100000000 | 1 | < 0.1% |
| Distinct count | 3722 |
|---|---|
| Unique (%) | 83.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10224.04346 |
|---|---|
| Minimum | 0 |
| Maximum | 656730 |
| Zeros | 9 |
| Zeros (%) | 0.2% |
| Memory size | 34.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 313 |
| Q1 | 1585 |
| median | 3352 |
| Q3 | 14672 |
| 95-th percentile | 37645 |
| Maximum | 656730 |
| Range | 656730 |
| Interquartile range (IQR) | 13087 |
Descriptive statistics
| Standard deviation | 18035.4089 |
|---|---|
| Coefficient of variation (CV) | 1.764019194 |
| Kurtosis | 400.1457367 |
| Mean | 10224.04346 |
| Median Absolute Deviation (MAD) | 10335.84546 |
| Skewness | 13.32508768 |
| Sum | 45404977 |
| Variance | 325275974.2 |
| Value | Count | Frequency (%) | |
| 0 | 9 | 0.2% | |
| 29 | 5 | 0.1% | |
| 2020 | 5 | 0.1% | |
| 1044 | 5 | 0.1% | |
| 1227 | 4 | 0.1% | |
| 2251 | 4 | 0.1% | |
| 646 | 4 | 0.1% | |
| 2 | 4 | 0.1% | |
| 1761 | 4 | 0.1% | |
| 2321 | 4 | 0.1% | |
| Other values (3712) | 4393 | 98.9% |
| Value | Count | Frequency (%) | |
| 0 | 9 | 0.2% | |
| 2 | 4 | 0.1% | |
| 4 | 1 | < 0.1% | |
| 5 | 3 | 0.1% | |
| 6 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 656730 | 1 | < 0.1% | |
| 303717 | 1 | < 0.1% | |
| 263584 | 1 | < 0.1% | |
| 170118 | 1 | < 0.1% | |
| 140268 | 1 | < 0.1% |
color
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.8 KiB |
| Color | |
|---|---|
| Black and White | 184 |
| Value | Count | Frequency (%) | |
| Color | 4257 | 95.9% | |
| Black and White | 184 | 4.1% |
Composition
| Contains chars | True |
|---|---|
| Contains digits | False |
| Contains whitespace | True |
| Contains non-words | True |
Length
| Max length | 16 |
|---|---|
| Mean length | 5.455753209 |
| Min length | 5 |
content_rating
Categorical
| Distinct count | 15 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.8 KiB |
| R | |
|---|---|
| PG-13 | |
| PG | |
| G | 109 |
| Not Rated | 99 |
| Other values (10) | 156 |
| Value | Count | Frequency (%) | |
| R | 2027 | 45.6% | |
| PG-13 | 1384 | 31.2% | |
| PG | 666 | 15.0% | |
| G | 109 | 2.5% | |
| Not Rated | 99 | 2.2% | |
| Unrated | 56 | 1.3% | |
| Approved | 54 | 1.2% | |
| X | 12 | 0.3% | |
| Passed | 9 | 0.2% | |
| NC-17 | 7 | 0.2% | |
| Other values (5) | 18 | 0.4% |
Composition
| Contains chars | True |
|---|---|
| Contains digits | True |
| Contains whitespace | True |
| Contains non-words | True |
Length
| Max length | 9 |
|---|---|
| Mean length | 2.759063274 |
| Min length | 1 |
| Distinct count | 54 |
|---|---|
| Unique (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.8 KiB |
| USA | |
|---|---|
| UK | 398 |
| France | 130 |
| Canada | 100 |
| Germany | 90 |
| Other values (49) | 301 |
| Value | Count | Frequency (%) | |
| USA | 3422 | 77.1% | |
| UK | 398 | 9.0% | |
| France | 130 | 2.9% | |
| Canada | 100 | 2.3% | |
| Germany | 90 | 2.0% | |
| Australia | 49 | 1.1% | |
| Spain | 31 | 0.7% | |
| Japan | 18 | 0.4% | |
| Italy | 17 | 0.4% | |
| China | 17 | 0.4% | |
| Other values (44) | 169 | 3.8% |
Composition
| Contains chars | True |
|---|---|
| Contains digits | False |
| Contains whitespace | True |
| Contains non-words | True |
Length
| Max length | 14 |
|---|---|
| Mean length | 3.438639946 |
| Min length | 2 |
| Distinct count | 4441 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2374.020941 |
|---|---|
| Minimum | 0 |
| Maximum | 5042 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 34.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 231 |
| Q1 | 1155 |
| median | 2327 |
| Q3 | 3564 |
| 95-th percentile | 4674 |
| Maximum | 5042 |
| Range | 5042 |
| Interquartile range (IQR) | 2409 |
Descriptive statistics
| Standard deviation | 1412.260583 |
|---|---|
| Coefficient of variation (CV) | 0.5948812661 |
| Kurtosis | -1.144780233 |
| Mean | 2374.020941 |
| Median Absolute Deviation (MAD) | 1216.970649 |
| Skewness | 0.09422882027 |
| Sum | 10543027 |
| Variance | 1994479.955 |
| Value | Count | Frequency (%) | |
| 2047 | 1 | < 0.1% | |
| 2604 | 1 | < 0.1% | |
| 4643 | 1 | < 0.1% | |
| 2596 | 1 | < 0.1% | |
| 549 | 1 | < 0.1% | |
| 4647 | 1 | < 0.1% | |
| 2600 | 1 | < 0.1% | |
| 553 | 1 | < 0.1% | |
| 557 | 1 | < 0.1% | |
| 2616 | 1 | < 0.1% | |
| Other values (4431) | 4431 | 99.8% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 5042 | 1 | < 0.1% | |
| 5037 | 1 | < 0.1% | |
| 5035 | 1 | < 0.1% | |
| 5034 | 1 | < 0.1% | |
| 5033 | 1 | < 0.1% |
| Distinct count | 424 |
|---|---|
| Unique (%) | 9.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 739.654132 |
|---|---|
| Minimum | 0 |
| Maximum | 23000 |
| Zeros | 753 |
| Zeros (%) | 17.0% |
| Memory size | 34.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 9 |
| median | 54 |
| Q3 | 212 |
| 95-th percentile | 1000 |
| Maximum | 23000 |
| Range | 23000 |
| Interquartile range (IQR) | 203 |
Descriptive statistics
| Standard deviation | 2928.865929 |
|---|---|
| Coefficient of variation (CV) | 3.959777688 |
| Kurtosis | 24.6824728 |
| Mean | 739.654132 |
| Median Absolute Deviation (MAD) | 1154.941447 |
| Skewness | 4.993370888 |
| Sum | 3284804 |
| Variance | 8578255.629 |
| Value | Count | Frequency (%) | |
| 0 | 753 | 17.0% | |
| 6 | 57 | 1.3% | |
| 11 | 52 | 1.2% | |
| 7 | 52 | 1.2% | |
| 3 | 51 | 1.1% | |
| 2 | 50 | 1.1% | |
| 4 | 48 | 1.1% | |
| 10 | 47 | 1.1% | |
| 9 | 46 | 1.0% | |
| 13 | 45 | 1.0% | |
| Other values (414) | 3240 | 73.0% |
| Value | Count | Frequency (%) | |
| 0 | 753 | 17.0% | |
| 2 | 50 | 1.1% | |
| 3 | 51 | 1.1% | |
| 4 | 48 | 1.1% | |
| 5 | 42 | 0.9% |
| Value | Count | Frequency (%) | |
| 23000 | 1 | < 0.1% | |
| 22000 | 8 | 0.2% | |
| 21000 | 10 | 0.2% | |
| 18000 | 4 | 0.1% | |
| 17000 | 20 | 0.5% |
| Distinct count | 2100 |
|---|---|
| Unique (%) | 47.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.8 KiB |
| Steven Spielberg | 26 |
|---|---|
| Woody Allen | 22 |
| Martin Scorsese | 20 |
| Clint Eastwood | 20 |
| Ridley Scott | 16 |
| Other values (2095) |
| Value | Count | Frequency (%) | |
| Steven Spielberg | 26 | 0.6% | |
| Woody Allen | 22 | 0.5% | |
| Martin Scorsese | 20 | 0.5% | |
| Clint Eastwood | 20 | 0.5% | |
| Ridley Scott | 16 | 0.4% | |
| Renny Harlin | 15 | 0.3% | |
| Steven Soderbergh | 15 | 0.3% | |
| Spike Lee | 15 | 0.3% | |
| Tim Burton | 14 | 0.3% | |
| Oliver Stone | 14 | 0.3% | |
| Other values (2090) | 4264 | 96.0% |
Composition
| Contains chars | True |
|---|---|
| Contains digits | False |
| Contains whitespace | True |
| Contains non-words | True |
Length
| Max length | 32 |
|---|---|
| Mean length | 13.0689034 |
| Min length | 3 |
duration
Real number (ℝ≥0)
| Distinct count | 157 |
|---|---|
| Unique (%) | 3.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 108.81288 |
|---|---|
| Minimum | 20 |
| Maximum | 330 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 34.8 KiB |
Quantile statistics
| Minimum | 20 |
|---|---|
| 5-th percentile | 84 |
| Q1 | 94 |
| median | 104 |
| Q3 | 119 |
| 95-th percentile | 146 |
| Maximum | 330 |
| Range | 310 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 22.32140079 |
|---|---|
| Coefficient of variation (CV) | 0.2051356493 |
| Kurtosis | 12.19301058 |
| Mean | 108.81288 |
| Median Absolute Deviation (MAD) | 15.77887457 |
| Skewness | 2.313159844 |
| Sum | 483238 |
| Variance | 498.2449332 |
| Value | Count | Frequency (%) | |
| 90 | 133 | 3.0% | |
| 100 | 127 | 2.9% | |
| 98 | 125 | 2.8% | |
| 101 | 122 | 2.7% | |
| 93 | 116 | 2.6% | |
| 99 | 115 | 2.6% | |
| 97 | 115 | 2.6% | |
| 94 | 111 | 2.5% | |
| 95 | 111 | 2.5% | |
| 107 | 102 | 2.3% | |
| Other values (147) | 3264 | 73.5% |
| Value | Count | Frequency (%) | |
| 20 | 1 | < 0.1% | |
| 25 | 1 | < 0.1% | |
| 37 | 1 | < 0.1% | |
| 45 | 1 | < 0.1% | |
| 53 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 330 | 1 | < 0.1% | |
| 325 | 1 | < 0.1% | |
| 300 | 1 | < 0.1% | |
| 293 | 1 | < 0.1% | |
| 289 | 1 | < 0.1% |
| Distinct count | 19 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.363882009 |
|---|---|
| Minimum | 0 |
| Maximum | 43 |
| Zeros | 1890 |
| Zeros (%) | 42.6% |
| Memory size | 34.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 43 |
| Range | 43 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.012683133 |
|---|---|
| Coefficient of variation (CV) | 1.475701799 |
| Kurtosis | 57.95438239 |
| Mean | 1.363882009 |
| Median Absolute Deviation (MAD) | 1.344092358 |
| Skewness | 4.65478897 |
| Sum | 6057 |
| Variance | 4.050893395 |
| Value | Count | Frequency (%) | |
| 0 | 1890 | 42.6% | |
| 1 | 1118 | 25.2% | |
| 2 | 639 | 14.4% | |
| 3 | 339 | 7.6% | |
| 4 | 181 | 4.1% | |
| 5 | 92 | 2.1% | |
| 6 | 64 | 1.4% | |
| 7 | 43 | 1.0% | |
| 8 | 34 | 0.8% | |
| 9 | 13 | 0.3% | |
| Other values (9) | 28 | 0.6% |
| Value | Count | Frequency (%) | |
| 0 | 1890 | 42.6% | |
| 1 | 1118 | 25.2% | |
| 2 | 639 | 14.4% | |
| 3 | 339 | 7.6% | |
| 4 | 181 | 4.1% |
| Value | Count | Frequency (%) | |
| 43 | 1 | < 0.1% | |
| 31 | 1 | < 0.1% | |
| 19 | 1 | < 0.1% | |
| 15 | 4 | 0.1% | |
| 14 | 1 | < 0.1% |
| Distinct count | 851 |
|---|---|
| Unique (%) | 19.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.8 KiB |
| Drama | 195 |
|---|---|
| Comedy | 177 |
| Comedy|Drama|Romance | 174 |
| Comedy|Drama | 167 |
| Drama|Romance | 143 |
| Other values (846) |
| Value | Count | Frequency (%) | |
| Drama | 195 | 4.4% | |
| Comedy | 177 | 4.0% | |
| Comedy|Drama|Romance | 174 | 3.9% | |
| Comedy|Drama | 167 | 3.8% | |
| Drama|Romance | 143 | 3.2% | |
| Comedy|Romance | 142 | 3.2% | |
| Crime|Drama|Thriller | 89 | 2.0% | |
| Action|Crime|Thriller | 60 | 1.4% | |
| Action|Crime|Drama|Thriller | 59 | 1.3% | |
| Horror | 57 | 1.3% | |
| Other values (841) | 3178 | 71.6% |
Composition
| Contains chars | True |
|---|---|
| Contains digits | False |
| Contains whitespace | False |
| Contains non-words | True |
Length
| Max length | 64 |
|---|---|
| Mean length | 20.63476694 |
| Min length | 5 |
gross
Real number (ℝ≥0)
| Distinct count | 3925 |
|---|---|
| Unique (%) | 88.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 48652043.17 |
|---|---|
| Minimum | 162 |
| Maximum | 760505847 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 34.8 KiB |
Quantile statistics
| Minimum | 162 |
|---|---|
| 5-th percentile | 146402 |
| Q1 | 7825820 |
| median | 33000377 |
| Q3 | 55942830 |
| 95-th percentile | 170708996 |
| Maximum | 760505847 |
| Range | 760505685 |
| Interquartile range (IQR) | 48117010 |
Descriptive statistics
| Standard deviation | 63839429.28 |
|---|---|
| Coefficient of variation (CV) | 1.312163377 |
| Kurtosis | 17.02826811 |
| Mean | 48652043.17 |
| Median Absolute Deviation (MAD) | 39847568.2 |
| Skewness | 3.30212485 |
| Sum | 2.160637237e+11 |
| Variance | 4.075472731e+15 |
| Value | Count | Frequency (%) | |
| 47644514.53 | 499 | 11.2% | |
| 8000000 | 3 | 0.1% | |
| 78900000 | 2 | < 0.1% | |
| 36000000 | 2 | < 0.1% | |
| 2000000 | 2 | < 0.1% | |
| 30400000 | 2 | < 0.1% | |
| 26400000 | 2 | < 0.1% | |
| 1000000 | 2 | < 0.1% | |
| 800000 | 2 | < 0.1% | |
| 25000000 | 2 | < 0.1% | |
| Other values (3915) | 3923 | 88.3% |
| Value | Count | Frequency (%) | |
| 162 | 1 | < 0.1% | |
| 703 | 1 | < 0.1% | |
| 721 | 1 | < 0.1% | |
| 828 | 1 | < 0.1% | |
| 1111 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 760505847 | 1 | < 0.1% | |
| 658672302 | 1 | < 0.1% | |
| 652177271 | 1 | < 0.1% | |
| 623279547 | 1 | < 0.1% | |
| 533316061 | 1 | < 0.1% |
imdb_score
Real number (ℝ≥0)
| Distinct count | 76 |
|---|---|
| Unique (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.432785409 |
|---|---|
| Minimum | 1.6 |
| Maximum | 9.3 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 34.8 KiB |
Quantile statistics
| Minimum | 1.6 |
|---|---|
| 5-th percentile | 4.4 |
| Q1 | 5.8 |
| median | 6.6 |
| Q3 | 7.2 |
| 95-th percentile | 8 |
| Maximum | 9.3 |
| Range | 7.7 |
| Interquartile range (IQR) | 1.4 |
Descriptive statistics
| Standard deviation | 1.099525537 |
|---|---|
| Coefficient of variation (CV) | 0.1709252629 |
| Kurtosis | 1.1148243 |
| Mean | 6.432785409 |
| Median Absolute Deviation (MAD) | 0.8485388831 |
| Skewness | -0.7731944673 |
| Sum | 28568 |
| Variance | 1.208956406 |
| Value | Count | Frequency (%) | |
| 6.7 | 205 | 4.6% | |
| 6.6 | 184 | 4.1% | |
| 6.5 | 175 | 3.9% | |
| 6.4 | 174 | 3.9% | |
| 6.8 | 170 | 3.8% | |
| 7.2 | 168 | 3.8% | |
| 7.1 | 164 | 3.7% | |
| 6.1 | 162 | 3.6% | |
| 7.3 | 160 | 3.6% | |
| 6.3 | 158 | 3.6% | |
| Other values (66) | 2721 | 61.3% |
| Value | Count | Frequency (%) | |
| 1.6 | 1 | < 0.1% | |
| 1.7 | 1 | < 0.1% | |
| 1.9 | 3 | 0.1% | |
| 2 | 2 | < 0.1% | |
| 2.1 | 3 | 0.1% |
| Value | Count | Frequency (%) | |
| 9.3 | 1 | < 0.1% | |
| 9.2 | 1 | < 0.1% | |
| 9 | 2 | < 0.1% | |
| 8.9 | 5 | 0.1% | |
| 8.8 | 5 | 0.1% |
language
Categorical
| Distinct count | 37 |
|---|---|
| Unique (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.8 KiB |
| English | |
|---|---|
| French | 50 |
| Spanish | 33 |
| Mandarin | 19 |
| German | 14 |
| Other values (32) | 112 |
| Value | Count | Frequency (%) | |
| English | 4213 | 94.9% | |
| French | 50 | 1.1% | |
| Spanish | 33 | 0.7% | |
| Mandarin | 19 | 0.4% | |
| German | 14 | 0.3% | |
| Hindi | 14 | 0.3% | |
| Japanese | 13 | 0.3% | |
| Portuguese | 8 | 0.2% | |
| Cantonese | 8 | 0.2% | |
| Italian | 8 | 0.2% | |
| Other values (27) | 61 | 1.4% |
Composition
| Contains chars | True |
|---|---|
| Contains digits | False |
| Contains whitespace | False |
| Contains non-words | False |
Length
| Max length | 10 |
|---|---|
| Mean length | 6.988966449 |
| Min length | 4 |
| Distinct count | 798 |
|---|---|
| Unique (%) | 18.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7917.134204 |
|---|---|
| Minimum | 0 |
| Maximum | 349000 |
| Zeros | 2024 |
| Zeros (%) | 45.6% |
| Memory size | 34.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 177 |
| Q3 | 5000 |
| 95-th percentile | 42000 |
| Maximum | 349000 |
| Range | 349000 |
| Interquartile range (IQR) | 5000 |
Descriptive statistics
| Standard deviation | 19864.23539 |
|---|---|
| Coefficient of variation (CV) | 2.509018399 |
| Kurtosis | 40.12413577 |
| Mean | 7917.134204 |
| Median Absolute Deviation (MAD) | 11449.1077 |
| Skewness | 4.970180909 |
| Sum | 35159993 |
| Variance | 394587847.5 |
| Value | Count | Frequency (%) | |
| 0 | 2024 | 45.6% | |
| 1000 | 101 | 2.3% | |
| 11000 | 76 | 1.7% | |
| 10000 | 72 | 1.6% | |
| 13000 | 58 | 1.3% | |
| 12000 | 56 | 1.3% | |
| 2000 | 51 | 1.1% | |
| 15000 | 47 | 1.1% | |
| 16000 | 45 | 1.0% | |
| 14000 | 44 | 1.0% | |
| Other values (788) | 1867 | 42.0% |
| Value | Count | Frequency (%) | |
| 0 | 2024 | 45.6% | |
| 4 | 1 | < 0.1% | |
| 7 | 2 | < 0.1% | |
| 10 | 1 | < 0.1% | |
| 12 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 349000 | 1 | < 0.1% | |
| 199000 | 1 | < 0.1% | |
| 197000 | 1 | < 0.1% | |
| 191000 | 1 | < 0.1% | |
| 190000 | 1 | < 0.1% |
| Distinct count | 4441 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.8 KiB |
| http://www.imdb.com/title/tt1701990/?ref_=fn_tt_tt_1 | 1 |
|---|---|
| http://www.imdb.com/title/tt0119494/?ref_=fn_tt_tt_1 | 1 |
| http://www.imdb.com/title/tt0300214/?ref_=fn_tt_tt_1 | 1 |
| http://www.imdb.com/title/tt0387808/?ref_=fn_tt_tt_1 | 1 |
| http://www.imdb.com/title/tt0790724/?ref_=fn_tt_tt_1 | 1 |
| Other values (4436) |
| Value | Count | Frequency (%) | |
| http://www.imdb.com/title/tt1701990/?ref_=fn_tt_tt_1 | 1 | < 0.1% | |
| http://www.imdb.com/title/tt0119494/?ref_=fn_tt_tt_1 | 1 | < 0.1% | |
| http://www.imdb.com/title/tt0300214/?ref_=fn_tt_tt_1 | 1 | < 0.1% | |
| http://www.imdb.com/title/tt0387808/?ref_=fn_tt_tt_1 | 1 | < 0.1% | |
| http://www.imdb.com/title/tt0790724/?ref_=fn_tt_tt_1 | 1 | < 0.1% | |
| http://www.imdb.com/title/tt0104431/?ref_=fn_tt_tt_1 | 1 | < 0.1% | |
| http://www.imdb.com/title/tt0102138/?ref_=fn_tt_tt_1 | 1 | < 0.1% | |
| http://www.imdb.com/title/tt1661382/?ref_=fn_tt_tt_1 | 1 | < 0.1% | |
| http://www.imdb.com/title/tt2103267/?ref_=fn_tt_tt_1 | 1 | < 0.1% | |
| http://www.imdb.com/title/tt0286244/?ref_=fn_tt_tt_1 | 1 | < 0.1% | |
| Other values (4431) | 4431 | 99.8% |
| Value | Count | Frequency (%) | |
| http | 4441 | 100.0% |
| Value | Count | Frequency (%) | |
| www.imdb.com | 4441 | 100.0% |
| Value | Count | Frequency (%) | |
| /title/tt0166195/ | 1 | < 0.1% | |
| /title/tt0365885/ | 1 | < 0.1% | |
| /title/tt2387559/ | 1 | < 0.1% | |
| /title/tt1542344/ | 1 | < 0.1% | |
| /title/tt0099422/ | 1 | < 0.1% | |
| /title/tt2183034/ | 1 | < 0.1% | |
| /title/tt0261392/ | 1 | < 0.1% | |
| /title/tt1034331/ | 1 | < 0.1% | |
| /title/tt0790736/ | 1 | < 0.1% | |
| /title/tt0156323/ | 1 | < 0.1% | |
| Other values (4431) | 4431 | 99.8% |
| Value | Count | Frequency (%) | |
| ref_=fn_tt_tt_1 | 4441 | 100.0% |
| Value | Count | Frequency (%) | |
| 4441 | 100.0% |
| Distinct count | 4441 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.8 KiB |
| Donkey Punch | 1 |
|---|---|
| Strangerland | 1 |
| Capote | 1 |
| Vera Drake | 1 |
| Hotel for Dogs | 1 |
| Other values (4436) |
| Value | Count | Frequency (%) | |
| Donkey Punch | 1 | < 0.1% | |
| Strangerland | 1 | < 0.1% | |
| Capote | 1 | < 0.1% | |
| Vera Drake | 1 | < 0.1% | |
| Hotel for Dogs | 1 | < 0.1% | |
| A Madea Christmas | 1 | < 0.1% | |
| Paddington | 1 | < 0.1% | |
| Seed of Chucky | 1 | < 0.1% | |
| Mrs Henderson Presents | 1 | < 0.1% | |
| Highlander | 1 | < 0.1% | |
| Other values (4431) | 4431 | 99.8% |
Composition
| Contains chars | True |
|---|---|
| Contains digits | True |
| Contains whitespace | True |
| Contains non-words | True |
Length
| Max length | 87 |
|---|---|
| Mean length | 16.28124296 |
| Min length | 2 |
num_critic_for_reviews
Real number (ℝ≥0)
| Distinct count | 527 |
|---|---|
| Unique (%) | 11.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 148.6437739 |
|---|---|
| Minimum | 1 |
| Maximum | 813 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 34.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 17 |
| Q1 | 62 |
| median | 120 |
| Q3 | 202 |
| 95-th percentile | 391 |
| Maximum | 813 |
| Range | 812 |
| Interquartile range (IQR) | 140 |
Descriptive statistics
| Standard deviation | 119.6388771 |
|---|---|
| Coefficient of variation (CV) | 0.8048697496 |
| Kurtosis | 2.990517209 |
| Mean | 148.6437739 |
| Median Absolute Deviation (MAD) | 90.30978445 |
| Skewness | 1.52534826 |
| Sum | 660127 |
| Variance | 14313.46091 |
| Value | Count | Frequency (%) | |
| 81 | 30 | 0.7% | |
| 112 | 28 | 0.6% | |
| 97 | 28 | 0.6% | |
| 43 | 27 | 0.6% | |
| 25 | 27 | 0.6% | |
| 64 | 27 | 0.6% | |
| 50 | 26 | 0.6% | |
| 63 | 26 | 0.6% | |
| 29 | 26 | 0.6% | |
| 61 | 26 | 0.6% | |
| Other values (517) | 4170 | 93.9% |
| Value | Count | Frequency (%) | |
| 1 | 8 | 0.2% | |
| 2 | 12 | 0.3% | |
| 3 | 7 | 0.2% | |
| 4 | 9 | 0.2% | |
| 5 | 15 | 0.3% |
| Value | Count | Frequency (%) | |
| 813 | 1 | < 0.1% | |
| 775 | 1 | < 0.1% | |
| 765 | 1 | < 0.1% | |
| 750 | 1 | < 0.1% | |
| 739 | 1 | < 0.1% |
num_user_for_reviews
Real number (ℝ≥0)
| Distinct count | 953 |
|---|---|
| Unique (%) | 21.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 290.4807476 |
|---|---|
| Minimum | 1 |
| Maximum | 5060 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 34.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 22 |
| Q1 | 83 |
| median | 174 |
| Q3 | 345 |
| 95-th percentile | 928 |
| Maximum | 5060 |
| Range | 5059 |
| Interquartile range (IQR) | 262 |
Descriptive statistics
| Standard deviation | 382.9447948 |
|---|---|
| Coefficient of variation (CV) | 1.318313857 |
| Kurtosis | 26.71061023 |
| Mean | 290.4807476 |
| Median Absolute Deviation (MAD) | 231.5473951 |
| Skewness | 4.145034849 |
| Sum | 1290025 |
| Variance | 146646.7159 |
| Value | Count | Frequency (%) | |
| 26 | 26 | 0.6% | |
| 50 | 24 | 0.5% | |
| 39 | 21 | 0.5% | |
| 31 | 21 | 0.5% | |
| 53 | 21 | 0.5% | |
| 69 | 20 | 0.5% | |
| 90 | 20 | 0.5% | |
| 73 | 20 | 0.5% | |
| 32 | 20 | 0.5% | |
| 55 | 19 | 0.4% | |
| Other values (943) | 4229 | 95.2% |
| Value | Count | Frequency (%) | |
| 1 | 5 | 0.1% | |
| 2 | 3 | 0.1% | |
| 3 | 8 | 0.2% | |
| 4 | 5 | 0.1% | |
| 5 | 6 | 0.1% |
| Value | Count | Frequency (%) | |
| 5060 | 1 | < 0.1% | |
| 4667 | 1 | < 0.1% | |
| 4144 | 1 | < 0.1% | |
| 3646 | 1 | < 0.1% | |
| 3597 | 1 | < 0.1% |
num_voted_users
Real number (ℝ≥0)
| Distinct count | 4353 |
|---|---|
| Unique (%) | 98.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 90310.95632 |
|---|---|
| Minimum | 28 |
| Maximum | 1689764 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 34.8 KiB |
Quantile statistics
| Minimum | 28 |
|---|---|
| 5-th percentile | 1599 |
| Q1 | 12324 |
| median | 40346 |
| Q3 | 104301 |
| 95-th percentile | 351274 |
| Maximum | 1689764 |
| Range | 1689736 |
| Interquartile range (IQR) | 91977 |
Descriptive statistics
| Standard deviation | 142873.8382 |
|---|---|
| Coefficient of variation (CV) | 1.582021097 |
| Kurtosis | 23.23207544 |
| Mean | 90310.95632 |
| Median Absolute Deviation (MAD) | 87557.4915 |
| Skewness | 3.935380646 |
| Sum | 401070957 |
| Variance | 2.041293365e+10 |
| Value | Count | Frequency (%) | |
| 3119 | 3 | 0.1% | |
| 3665 | 3 | 0.1% | |
| 2541 | 3 | 0.1% | |
| 922 | 2 | < 0.1% | |
| 25332 | 2 | < 0.1% | |
| 23023 | 2 | < 0.1% | |
| 36108 | 2 | < 0.1% | |
| 27882 | 2 | < 0.1% | |
| 12980 | 2 | < 0.1% | |
| 1231 | 2 | < 0.1% | |
| Other values (4343) | 4418 | 99.5% |
| Value | Count | Frequency (%) | |
| 28 | 1 | < 0.1% | |
| 48 | 1 | < 0.1% | |
| 50 | 1 | < 0.1% | |
| 53 | 2 | < 0.1% | |
| 60 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1689764 | 1 | < 0.1% | |
| 1676169 | 1 | < 0.1% | |
| 1468200 | 1 | < 0.1% | |
| 1347461 | 1 | < 0.1% | |
| 1324680 | 1 | < 0.1% |
| Distinct count | 4437 |
|---|---|
| Unique (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.8 KiB |
| one word title | 3 |
|---|---|
| based on novel | 3 |
| paralympics|quad rugby|rugby|team|wheelchair | 1 |
| 17th century|girl|maid|painter|painting | 1 |
| apprentice|demon|exorcism|master apprentice relationship|witch | 1 |
| Other values (4432) |
| Value | Count | Frequency (%) | |
| one word title | 3 | 0.1% | |
| based on novel | 3 | 0.1% | |
| paralympics|quad rugby|rugby|team|wheelchair | 1 | < 0.1% | |
| 17th century|girl|maid|painter|painting | 1 | < 0.1% | |
| apprentice|demon|exorcism|master apprentice relationship|witch | 1 | < 0.1% | |
| baby|desert island|island|sequel|teenage girl | 1 | < 0.1% | |
| ejected from a moving vehicle|gun held to head|handcuffs|shot multiple times|strangulation | 1 | < 0.1% | |
| abdication|china|emperor|forbidden city|republic | 1 | < 0.1% | |
| cattle|cow|dairy farm|farm|rustler | 1 | < 0.1% | |
| mutant|superhero|superhero team|x men|year 1983 | 1 | < 0.1% | |
| Other values (4427) | 4427 | 99.7% |
Composition
| Contains chars | True |
|---|---|
| Contains digits | True |
| Contains whitespace | True |
| Contains non-words | True |
Length
| Max length | 149 |
|---|---|
| Mean length | 52.49268183 |
| Min length | 2 |
title_year
Real number (ℝ≥0)
| Distinct count | 88 |
|---|---|
| Unique (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2001.929971 |
|---|---|
| Minimum | 1927 |
| Maximum | 2016 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 34.8 KiB |
Quantile statistics
| Minimum | 1927 |
|---|---|
| 5-th percentile | 1978 |
| Q1 | 1998 |
| median | 2005 |
| Q3 | 2010 |
| 95-th percentile | 2015 |
| Maximum | 2016 |
| Range | 89 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 12.34025148 |
|---|---|
| Coefficient of variation (CV) | 0.006164177397 |
| Kurtosis | 6.832351141 |
| Mean | 2001.929971 |
| Median Absolute Deviation (MAD) | 8.477429589 |
| Skewness | -2.213872958 |
| Sum | 8890571 |
| Variance | 152.2818065 |
| Value | Count | Frequency (%) | |
| 2009 | 229 | 5.2% | |
| 2006 | 220 | 5.0% | |
| 2008 | 214 | 4.8% | |
| 2010 | 210 | 4.7% | |
| 2011 | 210 | 4.7% | |
| 2005 | 203 | 4.6% | |
| 2004 | 197 | 4.4% | |
| 2002 | 196 | 4.4% | |
| 2013 | 192 | 4.3% | |
| 2012 | 191 | 4.3% | |
| Other values (78) | 2379 | 53.6% |
| Value | Count | Frequency (%) | |
| 1927 | 1 | < 0.1% | |
| 1929 | 2 | < 0.1% | |
| 1930 | 1 | < 0.1% | |
| 1932 | 1 | < 0.1% | |
| 1933 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 2016 | 72 | 1.6% | |
| 2015 | 152 | 3.4% | |
| 2014 | 184 | 4.1% | |
| 2013 | 192 | 4.3% | |
| 2012 | 191 | 4.3% |
First rows
| actor_1_facebook_likes | actor_1_name | actor_2_facebook_likes | actor_2_name | actor_3_facebook_likes | actor_3_name | aspect_ratio | budget | cast_total_facebook_likes | color | content_rating | country | df_index | director_facebook_likes | director_name | duration | facenumber_in_poster | genres | gross | imdb_score | language | movie_facebook_likes | movie_imdb_link | movie_title | num_critic_for_reviews | num_user_for_reviews | num_voted_users | plot_keywords | title_year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1000.0 | CCH Pounder | 936.0 | Joel David Moore | 855.0 | Wes Studi | 1.78 | 237000000.0 | 4834 | Color | PG-13 | USA | 0 | 0.0 | James Cameron | 178.0 | 0.0 | Action|Adventure|Fantasy|Sci-Fi | 760505847.0 | 7.9 | English | 33000 | http://www.imdb.com/title/tt0499549/?ref_=fn_tt_tt_1 | Avatar | 723.0 | 3054.0 | 886204 | avatar|future|marine|native|paraplegic | 2009.0 |
| 1 | 40000.0 | Johnny Depp | 5000.0 | Orlando Bloom | 1000.0 | Jack Davenport | 2.35 | 300000000.0 | 48350 | Color | PG-13 | USA | 1 | 563.0 | Gore Verbinski | 169.0 | 0.0 | Action|Adventure|Fantasy | 309404152.0 | 7.1 | English | 0 | http://www.imdb.com/title/tt0449088/?ref_=fn_tt_tt_1 | Pirates of the Caribbean: At World's End | 302.0 | 1238.0 | 471220 | goddess|marriage ceremony|marriage proposal|pirate|singapore | 2007.0 |
| 2 | 11000.0 | Christoph Waltz | 393.0 | Rory Kinnear | 161.0 | Stephanie Sigman | 2.35 | 245000000.0 | 11700 | Color | PG-13 | UK | 2 | 0.0 | Sam Mendes | 148.0 | 1.0 | Action|Adventure|Thriller | 200074175.0 | 6.8 | English | 85000 | http://www.imdb.com/title/tt2379713/?ref_=fn_tt_tt_1 | Spectre | 602.0 | 994.0 | 275868 | bomb|espionage|sequel|spy|terrorist | 2015.0 |
| 3 | 27000.0 | Tom Hardy | 23000.0 | Christian Bale | 23000.0 | Joseph Gordon-Levitt | 2.35 | 250000000.0 | 106759 | Color | PG-13 | USA | 3 | 22000.0 | Christopher Nolan | 164.0 | 0.0 | Action|Thriller | 448130642.0 | 8.5 | English | 164000 | http://www.imdb.com/title/tt1345836/?ref_=fn_tt_tt_1 | The Dark Knight Rises | 813.0 | 2701.0 | 1144337 | deception|imprisonment|lawlessness|police officer|terrorist plot | 2012.0 |
| 4 | 640.0 | Daryl Sabara | 632.0 | Samantha Morton | 530.0 | Polly Walker | 2.35 | 263700000.0 | 1873 | Color | PG-13 | USA | 5 | 475.0 | Andrew Stanton | 132.0 | 1.0 | Action|Adventure|Sci-Fi | 73058679.0 | 6.6 | English | 24000 | http://www.imdb.com/title/tt0401729/?ref_=fn_tt_tt_1 | John Carter | 462.0 | 738.0 | 212204 | alien|american civil war|male nipple|mars|princess | 2012.0 |
| 5 | 24000.0 | J.K. Simmons | 11000.0 | James Franco | 4000.0 | Kirsten Dunst | 2.35 | 258000000.0 | 46055 | Color | PG-13 | USA | 6 | 0.0 | Sam Raimi | 156.0 | 0.0 | Action|Adventure|Romance | 336530303.0 | 6.2 | English | 0 | http://www.imdb.com/title/tt0413300/?ref_=fn_tt_tt_1 | Spider-Man 3 | 392.0 | 1902.0 | 383056 | sandman|spider man|symbiote|venom|villain | 2007.0 |
| 6 | 799.0 | Brad Garrett | 553.0 | Donna Murphy | 284.0 | M.C. Gainey | 1.85 | 260000000.0 | 2036 | Color | PG | USA | 7 | 15.0 | Nathan Greno | 100.0 | 1.0 | Adventure|Animation|Comedy|Family|Fantasy|Musical|Romance | 200807262.0 | 7.8 | English | 29000 | http://www.imdb.com/title/tt0398286/?ref_=fn_tt_tt_1 | Tangled | 324.0 | 387.0 | 294810 | 17th century|based on fairy tale|disney|flower|tower | 2010.0 |
| 7 | 26000.0 | Chris Hemsworth | 21000.0 | Robert Downey Jr. | 19000.0 | Scarlett Johansson | 2.35 | 250000000.0 | 92000 | Color | PG-13 | USA | 8 | 0.0 | Joss Whedon | 141.0 | 4.0 | Action|Adventure|Sci-Fi | 458991599.0 | 7.5 | English | 118000 | http://www.imdb.com/title/tt2395427/?ref_=fn_tt_tt_1 | Avengers: Age of Ultron | 635.0 | 1117.0 | 462669 | artificial intelligence|based on comic book|captain america|marvel cinematic universe|superhero | 2015.0 |
| 8 | 25000.0 | Alan Rickman | 11000.0 | Daniel Radcliffe | 10000.0 | Rupert Grint | 2.35 | 250000000.0 | 58753 | Color | PG | UK | 9 | 282.0 | David Yates | 153.0 | 3.0 | Adventure|Family|Fantasy|Mystery | 301956980.0 | 7.5 | English | 10000 | http://www.imdb.com/title/tt0417741/?ref_=fn_tt_tt_1 | Harry Potter and the Half-Blood Prince | 375.0 | 973.0 | 321795 | blood|book|love|potion|professor | 2009.0 |
| 9 | 15000.0 | Henry Cavill | 4000.0 | Lauren Cohan | 2000.0 | Alan D. Purwin | 2.35 | 250000000.0 | 24450 | Color | PG-13 | USA | 10 | 0.0 | Zack Snyder | 183.0 | 0.0 | Action|Adventure|Sci-Fi | 330249062.0 | 6.9 | English | 197000 | http://www.imdb.com/title/tt2975590/?ref_=fn_tt_tt_1 | Batman v Superman: Dawn of Justice | 673.0 | 3018.0 | 371639 | based on comic book|batman|sequel to a reboot|superhero|superman | 2016.0 |
Last rows
| actor_1_facebook_likes | actor_1_name | actor_2_facebook_likes | actor_2_name | actor_3_facebook_likes | actor_3_name | aspect_ratio | budget | cast_total_facebook_likes | color | content_rating | country | df_index | director_facebook_likes | director_name | duration | facenumber_in_poster | genres | gross | imdb_score | language | movie_facebook_likes | movie_imdb_link | movie_title | num_critic_for_reviews | num_user_for_reviews | num_voted_users | plot_keywords | title_year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4431 | 830.0 | Mark Duplass | 224.0 | Katie Aselton | 10.0 | Bari Hyman | 2.10413 | 15000.0 | 1064 | Color | R | USA | 5021 | 157.0 | Jay Duplass | 85.0 | 0.0 | Comedy|Drama|Romance | 1.924670e+05 | 6.6 | English | 297 | http://www.imdb.com/title/tt0436689/?ref_=fn_tt_tt_1 | The Puffy Chair | 51.0 | 71.0 | 4067 | birthday|gift|motel|new york city|upholsterer | 2005.0 |
| 4432 | 407.0 | Sean Whalen | 91.0 | Jason Trost | 86.0 | Nick Principe | 2.35000 | 20000.0 | 674 | Color | Unrated | USA | 5024 | 91.0 | Jason Trost | 78.0 | 0.0 | Sci-Fi|Thriller | 4.764451e+07 | 4.0 | English | 835 | http://www.imdb.com/title/tt1836212/?ref_=fn_tt_tt_1 | All Superheroes Must Die | 42.0 | 35.0 | 1771 | arch villain|game of death|kidnapping|superhero | 2011.0 |
| 4433 | 462.0 | Divine | 143.0 | Mink Stole | 105.0 | Edith Massey | 1.37000 | 10000.0 | 760 | Color | NC-17 | USA | 5025 | 0.0 | John Waters | 108.0 | 2.0 | Comedy|Crime|Horror | 1.804830e+05 | 6.1 | English | 0 | http://www.imdb.com/title/tt0069089/?ref_=fn_tt_tt_1 | Pink Flamingos | 73.0 | 183.0 | 16792 | absurd humor|egg|gross out humor|lesbian|sex | 1972.0 |
| 4434 | 576.0 | Maggie Cheung | 133.0 | Béatrice Dalle | 45.0 | Don McKellar | 2.35000 | 4500.0 | 776 | Color | R | France | 5026 | 107.0 | Olivier Assayas | 110.0 | 1.0 | Drama|Music|Romance | 1.360070e+05 | 6.9 | French | 171 | http://www.imdb.com/title/tt0388838/?ref_=fn_tt_tt_1 | Clean | 81.0 | 39.0 | 3924 | jail|junkie|money|motel|singer | 2004.0 |
| 4435 | 5.0 | Fereshteh Sadre Orafaiy | 0.0 | Nargess Mamizadeh | 0.0 | Mojgan Faramarzi | 1.85000 | 10000.0 | 5 | Color | Not Rated | Iran | 5027 | 397.0 | Jafar Panahi | 90.0 | 0.0 | Drama | 6.737800e+05 | 7.5 | Persian | 697 | http://www.imdb.com/title/tt0255094/?ref_=fn_tt_tt_1 | The Circle | 64.0 | 26.0 | 4555 | abortion|bus|hospital|prison|prostitution | 2000.0 |
| 4436 | 291.0 | Shane Carruth | 45.0 | David Sullivan | 8.0 | Casey Gooden | 1.85000 | 7000.0 | 368 | Color | PG-13 | USA | 5033 | 291.0 | Shane Carruth | 77.0 | 0.0 | Drama|Sci-Fi|Thriller | 4.247600e+05 | 7.0 | English | 19000 | http://www.imdb.com/title/tt0390384/?ref_=fn_tt_tt_1 | Primer | 143.0 | 371.0 | 72639 | changing the future|independent film|invention|nonlinear timeline|time travel | 2004.0 |
| 4437 | 0.0 | Ian Gamazon | 0.0 | Edgar Tancangco | 0.0 | Quynn Ton | 2.10413 | 7000.0 | 0 | Color | Not Rated | Philippines | 5034 | 0.0 | Neill Dela Llana | 80.0 | 0.0 | Thriller | 7.007100e+04 | 6.3 | English | 74 | http://www.imdb.com/title/tt0428303/?ref_=fn_tt_tt_1 | Cavite | 35.0 | 35.0 | 589 | jihad|mindanao|philippines|security guard|squatter | 2005.0 |
| 4438 | 121.0 | Carlos Gallardo | 20.0 | Peter Marquardt | 6.0 | Consuelo Gómez | 1.37000 | 7000.0 | 147 | Color | R | USA | 5035 | 0.0 | Robert Rodriguez | 81.0 | 0.0 | Action|Crime|Drama|Romance|Thriller | 2.040920e+06 | 6.9 | Spanish | 0 | http://www.imdb.com/title/tt0104815/?ref_=fn_tt_tt_1 | El Mariachi | 56.0 | 130.0 | 52055 | assassin|death|guitar|gun|mariachi | 1992.0 |
| 4439 | 296.0 | Kerry Bishé | 205.0 | Caitlin FitzGerald | 133.0 | Daniella Pineda | 2.10413 | 9000.0 | 690 | Color | Not Rated | USA | 5037 | 0.0 | Edward Burns | 95.0 | 1.0 | Comedy|Drama | 4.584000e+03 | 6.4 | English | 413 | http://www.imdb.com/title/tt1880418/?ref_=fn_tt_tt_1 | Newlyweds | 14.0 | 14.0 | 1338 | written and directed by cast member | 2011.0 |
| 4440 | 86.0 | John August | 23.0 | Brian Herzlinger | 16.0 | Jon Gunn | 1.85000 | 1100.0 | 163 | Color | PG | USA | 5042 | 16.0 | Jon Gunn | 90.0 | 0.0 | Documentary | 8.522200e+04 | 6.6 | English | 456 | http://www.imdb.com/title/tt0378407/?ref_=fn_tt_tt_1 | My Date with Drew | 43.0 | 84.0 | 4285 | actress name in title|crush|date|four word title|video camera | 2004.0 |